On random sequence spacing distributions
نویسنده
چکیده
The random model for allocation of an amino acid A at locations along a protein chain is controlled by an underlying binomial distribution for occurrence of A, with density given by the relative abundance of A. Here we derive analytically the distribution of lengths of inter-A spaces in sequences of arbitrary length n, with arbitrary relative abundance p of occurrencies of A. This provides a well-defined reference structure for comparison with observations. We derive also the distribution function for inter-AA spaces. It turns out that the standard deviation of inter-A space length is approximately proportional to the mean inter-A space length, independently of sequence length n and relative abundance p. This means that, to the extent that the space length can be approximated by a continuous random variable, the distribution of space lengths is represented by a gamma distribution. The significance of this latter approximation is that the gamma family may provide models for spacing length distributions when the underlying process is not binomial, for example, when clustering or evening out of amino acids occurs. The new results show that actual sequences of amino acids also do have standard deviation of spacing lengths proportional to the mean but all exhibit more self-clustering than expected for finite sequences from a random process.
منابع مشابه
Series expansions for level-spacing distributions of the Gaussian unitary random matrix ensemble
The Open University's repository of research publications and other research outputs Series expansions for level-spacing distributions of the Gaussian unitary random matrix ensemble Journal Article How to cite: Grimm, Uwe (2004). Series expansions for level-spacing distributions of the Gaussian unitary random matrix ensemble. Physica status solidi b: basic research, 241(9) pp. 2139–2147. For gu...
متن کاملLevel-spacing distributions of the Gaussian unitary random matrix ensemble
Level-spacing distributions of the Gaussian Unitary Ensemble (GUE) of random matrix theory are expressed in terms of solutions of coupled differential equations. Series solutions up to order 50 in the level spacing are obtained, thus providing a very good description of the small-spacing part of the level-spacing distribution, which can be used to make comparisons with experimental or numerical...
متن کاملSpacing distributions in random matrix ensembles
The topic of spacing distributions in random matrix ensembles is almost as old as the introduction of random matrix theory into nuclear physics. Both events can be traced back to Wigner in the mid 1950’s [37, 38]. Thus Wigner introduced the model of a large real symmetric random matrix, in which the upper triangular elements are independently distributed with zero mean and constant variance, fo...
متن کاملWigner surmise for mixed symmetry classes in random matrix theory.
We consider the nearest-neighbor spacing distributions of mixed random matrix ensembles interpolating between different symmetry classes or between integrable and nonintegrable systems. We derive analytical formulas for the spacing distributions of 2×2 or 4×4 matrices and show numerically that they provide very good approximations for those of random matrices with large dimension. This generali...
متن کاملCorrelation Functions, Cluster Functions, and Spacing Distributions for Random Matrices
The usual formulas for the correlation functions in orthogonal and symplectic matrix models express them as quaternion determinants. From this representation one can deduce formulas for spacing probabilities in terms of Fredholm determinants of matrix-valued kernels. The derivations of the various formulas are somewhat involved. In this article we present a direct approach which leads immediate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005